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18.1. Introduction. 



This chapter is a basic review of the fundamental principles of modern spectroscopy, 
designed to provide a research student access to a widespread field. We begin by describing 
the basic properties of spectrometers and emphasize the relative merits of certain techniques. 
Attention is given to recent developments in monochromators, in particular, narrowband 
and tunable filters. Extensive historical reviews are to be found elsewhere [1,2]. While 
spectroscopic techniques continue to evolve, they rely on either dispersion (refractive prisms) 
or multi-beam interference (diffractive gratings, interferometers). Prisms are of great 
historical importance [3,4,5], but their performance is now surpassed by transmissive and 
reflective gratings [6,7,8]. At the close of the 19th century, three important developments 
occurred. The curved grating removed the need for auxiliary optics, thereby extending 
spectroscopy into the far ultraviolet and infrared wavelengths [7]. This was soon followed 
by the development of the Fourier Transform [9] and Fabry-Perot [10] interferometers 
that have many uses today. Good general discussions can be found in [11,12,13]. More 
specialized treatments are as follows: prisms [11,14,15], gratings [2,16], Fabry-Perot [17,18] 
and Fourier Transform interferometers [19,20]. 



18.2. Basic Principles. 

The most useful figure of merit of a spectrometer is the product of the resolving power 
(TZ) and the throughput (T). The throughput is defined asT = A-Q where A is the normal 
area of the beam and fl is the solid angle subtended by the source. The resolving power is 
defined as 71 = \/6X where S\ is known as the spectral purity, or the smallest measurable 
wavelength difference at a given wavelength A. In a properly matched optical system. 



^Current Address: Anglo- Australian Observatory, P.O. Box 296, Epping, NSW 2121. 
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the throughput, or equivalently, the flux through the spectrometer depends ultimately on 
the entrance aperture and the area of the dispersive element. The theoretical limit to 
the resolving power is set by the characteristic dimension of the spectrometer (e.g. prism 
base). In practice, it is often advantageous to accept a lower value of 7^, for example, 
by widening the entrance slit to allow more light to enter the spectrometer. Indeed, if 
an observed spectral line is not diffraction limited, the flux through the spectrometer is 
inversely related to the resolving power of the system. Thus, it makes sense to compare the 
relative merits of spectrometers at the same effective resolving power. If we match the area 
of the dispersing element for each technique — prism, grating, Fabry-Perot etalon, Fourier 
Transform beam-splitter — we flnd that the solid acceptance angles of the latter techniques 
have a major throughput (Jacquinot) advantage over the others. 

18.2.1 Throughput Advantage. Jacquinot [21] demonstrated the relative merit of prisms, 
gratings and Fabry-Perot etalons on the basis of throughput. The simplest spectrometer 
comprises a collimator optic to equalize the optical path lengths of each ray between 
the entrance aperture and the disperser, and a camera optic to reverse the action of the 
collimator by imaging the dispersed light onto the detector. If the solid angle subtended 
by the object, or source, at the distance of the collimator is dQ, and the projected area of 
the collimator is dA, the radiant flux falling on the collimator from a source of brightness 
B is given by dF = B ■ dA ■ dVt. The quantity dA ■ dQ = dF/B is simply the throughput 
or the etendue of the system. In a properly matched (lossless) optical system, the etendue 
is a constant everywhere along the optical path, in which case the brightness of the source 
is equal to the brightness of the image. This is easily seen for a simple lens with focal 
length /. A small element dA at the focal (or object) distance subtending a solid angle 
dfl is brought to a focus at the image plane distance /' with area dA' subtending an 
angle dQ'. The linear magniflcation, both horizontally and vertically, is /'//• Therefore 
dA'/dA = {f'/fy. However, this is compensated exactly by the change in dQ'/dQ such that 
dA ■ dQ = dA' ■ dQ'. Note that the f /ratio of the optic sets the solid angle of the system. 
Therefore, it is important to match all optical elements in series such that the output of one 
element fllls the aperture of the next. On the basis of throughput alone, the Fabry-Perot 
is superior to grating instruments, which in turn have superiority over prisms. However, 
in practice, Fabry-Perots are normally used for high resolution observations over a narrow 
wavelength interval. A better comparison is with the Fourier Transform spectrometer 
(FTS) which, for the same bandwidth, has a much higher throughput than all slit-aperture 
spectrometers in the same conflguration. 

18.2.2. Resolving Power. The resolution of a spectrometer is set by the bandwidth 
limit imposed by the dispersing element. An elegant demonstration using Fourier optics 
is quoted by Gray [16]. When a beam of light passes through an aperture of diameter L, 
in the far fleld approximation, a Fraunhofer diffraction pattern arises.0 The width of the 
central intensity spike is proportional to X/L which is roughly 6X for most resolution (e.g. 
Rayleigh, Sparrow) criteria. Because the dispersing element deflnes a flnite baseline or 
aperture, the ultimate instrumental resolution is set by the diffraction limit. This has the 



^The radial diffraction pattern is a sinc^ function where sine y = siny/y. Recall that the sine function is 
the Fourier Transform of the rectangle ('top hat') function. 
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simple consequence that the highest spectroscopic resolutions have generally been obtained 
with large spectrometers. 

The theoretical value of TZ is rarely achieved in practice, not least because of optical 
and mechanical defects within the instrument. The width of the instrumental profile, i.e., 
the response of the spectrometer to a monochromatic input, must be matched carefully to 
the size of each detector element (or pixel) and the sampling interval defined by the angular 
dispersion. Gray [16] shows that an observed spectrum arises from the product of three 
functions, viz., Sx = Bx ■ lllx ■ Ox where Ox is the original spectrum, Bx is a rectangle 
function that defines the baseline of the dispersing element, and IIIa is a Shah function that 
describes the regular sampling. Because the Fourier Transform of the observed spectrum 
is the convolution of the transform of the individual functions, discrete sampling causes 
the transform of the original spectrum to replicate with a periodicity l/6x where 6x is 
the sampling interval. If the original spectrum is undersampled by the spectrometer, the 
adjacent orders of the transformed spectra overlap and cannot be disentangled uniquely. 
This problem of aliasing can be avoided by Nyquist sampling, i.e., sampling the source 
spectrum at twice the frequency of the highest Fourier component that you wish to study. 

18.2.3. Detector Constraints. Most contemporary spectrometers employ panoramic, 
electronic detectors because they are highly sensitive (more than 80% of the incident 
photons can be detected in many cases), provide digital output, have a linear response 
over a dynamic range of 10^, and record photons across a two-dimensional field. Several 
corporations now routinely manufacture low-noise, large area charge-coupled devices 
(CCDs): in the optical, 2048^ 15//m and 4096^ 7.5fiTa pixel arrays are now available; in the 
infrared, arrays of 1024^ pixels have been fabricated. The photo-sensitive area is currently 
limited to the 10 cm diameter of a silicon wafer, but suitably designed CCDs can be 
edge-butted to form large mosaics. A full description of the limitations and capabihties of 
modern CCDs can be found in [22]. At optical wavelengths, the largest detector formats 
are 35 cm photographic plates. The Kodak TechPan emulsions have a quantum efficiency 
close to 5% and an effective resolution element of roughly 5 fj,m. 

To minimize the thermal and coUisional excitation of electrons (which contribute 
a 'dark current'), arrays must be cooled to below 30 K for InSb infrared arrays (which 
therefore require inconvenient cryogcns) or to below 210 K for modern optical CCDs 
(obtained using thermoelectric coolers). Both require water-free atmospheric chambers to 
prevent frost, which restricts access to the focal plane. Uncorrelated noise sources combine 
in quadrature, so the effective signal-to-noise ratio counted at each pixel is 



for which ctr is the read noise (generated in the output amplifier, in rms electrons); s,yA,y, 
hyAy, and d are the number of electrons per second generated by the object, background, 
and dark current respectively in a frequency interval Aj,; t is the exposure time (seconds); 
and uj is the solid angle subtended by each pixel. The factor of two in the denominator 
assumes that the corresponding noise sources are measured on separate exposures, then 
subtracted from the data frame. Read noise on modern optical CCDs is a few electrons 
rms; infrared arrays are at least ten times noisier but their performance can be improved 
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Fig. 1. — A rudimentary Hadamard spectrometer and one-dimensional detector array. The 
only light to pass through the vertical entrance slit is through the mask holes aligned with 
the slit. After sliding each column over the slit, it is possible to reconstruct the spectrum at 
each spatial position along the slit even though the detector does not extend in this direction. 

with non-dcstructivc, multiple read out. In practice, the S/N ratio will be smaller than 
predicted by Eq. (1) because multiplicative ('flat flcld') and additive ('bias offset') gains 
must be determined empirically and applied to each detector element [23]. Except at very 
low and high flux levels, the CCD is a hnear detector so these calibration steps are often 
quite successful. 

18.2.4. Multiplex Advantage. In recent decades, the sense of what constitutes a multiplex 
(Fcllgett) advantage has evolved. The traditional meaning arises from single element or 
row element detectors which used to prevail at infrared wavelengths. With a single element 
detector, a two-dimensional image (either spatial-spatial or spatial-spectral) was made by 
scanning at many positions over a regular grid. The same image is more easily obtained 
with a one-dimensional detector array after aligning one axis of the image with the detector 
and then shifting the detector in discrete stages along the other axis. For a single element 
detector, Fellgett [24] realized that there is an important advantage to be gained by 
recording more than one spectral increment (channel) simultaneously if the signal detection 
is limited by detector (background) noise. If the receiver observes in sequence n spectral 
channels dispersed by a prism or grating for a total exposure time of r, the S/N ratio within 

each channel is proportional to y'r/n. So, a Fourier Transform device (see § 18.6), which 
observes all n spectral channels for the entire duration, has a spectral multiplex advantage 
of ^/n compared with conventional slit spectrometers. Thus, a multiplex advantage makes 
more efficient use of the available light. 

In principle, all spectroscopic techniques can achieve a multiplex advantage with the 
use of a multiplexed or coded aperture mask. Spectrometers with cylindrical symmetry (e.g. 
Fabry-Perots) use circular masks, while slit aperture devices (e.g. gratings) use rectangular 
masks. Fig. 1 illustrates the special case of Hadamard coded apertures. A cyclic mask Hij 
with n rows is placed at the entrance slit; a particular column can be aligned with the slit 
by sliding the mask. In this way, the dispersing element can be illuminated through each 
of the mask columns in turn. A one-dimensional detector array is aligned with the spectral 
dispersion so that each spectral channel receives all of the modulated signal through the 
slit at a discrete frequency. For each spectral channel, the modulated signal Mj = H^jOj 
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can be used to derive the original signal Oj at each position along the slit. This assumes 
that the (square) Hadamard array Hij can be inverted and that the influence of systematic 
errors can be minimized. The multiplex advantage is roughly if the mask columns 

have approximately half the holes open. 

With the advent of large format, low noise detectors in many wavebands, the 
conventional definition of the multiplex advantage is only significant for the highest spectral 
resolutions. It has become difficult to generalize under what conditions a multiplex 
advantage might prevail [11]. For certain applications, it can be inefficient to match the 
acceptance solid angle of the spectrometer to the imaging detector. As an example, optical 
fibers in astronomy — after major improvements in their blue to near-infrared response — 
have afforded a significant spatial multiplex advantage in recent years [25,26]. The light 
from discrete sources over a sparse field can be collected with individual fibers which are 
then aligned along a slit. The multiplex advantage, when compared with conventional slit 
spectrometers, is simply the number of fibers if the signal attenuation along the fiber length 
is negligible. 

18.2.5. Optics Design. All spectrometers make use of the constructive interference of light 
by dividing the wavefront (e.g. gratings), by dividing the amplitude of the wavefront (e.g. 
Fabry- Perots) , or by decomposing the wavefront into orthogonal polarization components 
(e.g. Lyots). The design of a spectrometer is in essence a competition between a dispersing 
element that deviates light into different angles according to wavelength, and optics 
that focus the light at the detector with minimum aberration. Two lenses in the optical 
path are normally sufficient to counteract the dominant aberrations [12]. In most cases, 
the dispersing element must be illuminated with parallel (collimated) light. Hence the 
acceptance angle (i.e. f/ratio) of the coUimating optic must be the same as that of the 
primary concentrating optics to transfer all the light. If the coUimated beam is too small, 
the Jacquinot advantage of the spectrometer is reduced [27,28]. If the collimated beam is 
too large, light is lost from the optical system. 

It is important to realize that all optical systems lose light at surfaces along the optical 
path. On occasion, the scattered light will simply leave the system. More often, the stray 
light finds its way back into the optical path to be imaged at the detector as a spurious 
'ghost' signal that can be difficult to distinguish from real signals. Scattered light can also 
dramatically increase the background signal at the detector, thereby reducing contrast and 
setting a limit on the sensitivity that can be reached within a given exposure time. The 
manner in which this happens is specific to the instrument. In later sections, we describe 
a few of the ghost families that occur within gratings and Fabry-Perot spectrometers. 
Anti-refiective coatings (including newly developed coatings whose index of refraction 
increases smoothly through their thickness), aperture stops, baffles and ingenious optical 
designs are part of the arsenal to combat these anomalies. 

In later sections, we describe diffraction gratings (§18.4), interference filters (§18.5.1), 
Fabry-Perot (§18.5.2) and Fourier Transform spectrometers (§18.6). We also discuss recent 
technological developments for selecting a bandpass whose central wavelength can be tuned 
over a wide spectral range. The most promising of these developments are those which 
utilise anisotropic media, particularly the Lyot filter (§18.5.3) and the acousto-optic filter 
(§18.5.4). 
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18.3. Prisms. 



Refractive prisms are no longer in common use as primary dispersers in slit 
spectrometers, although they are frequently used as cross dispersers in high-order 
spectroscopy and they play an important role in immersion gratings. However, prisms 
highlight some of the basic principles discussed in the previous section. Their operation 
is fully specified by Snell's law of refraction and the dispersive properties of the medium. 
A light ray incident on the face of a prism with refractive index nx deviates by an angle 
5 = i — r, where i is the angle to the face normal, and r is the refracted angle. If we take 
the external medium to be air, then sini = n\ sinr. For a restricted wavelength range, the 
Hartmann dispersion formula, nx = a + b{X — c)~^, provides a good approximation to many 
materials in the Schott glass catalog. The constants a, b and c depend on the material. 

Fig. 2(i) illustrates the angle convention for a ray passing through a prism. We note 
that 6i = ii — ri, 62 = 12 — ^2-1 and the prism apex angle, a = ri + r2, such that the deviation 
is given by ^ = (5i + ^2 = ii + ^2 — o; or equivalently 

9{a, A, ii) = ii — a -\- sin~^ [nx sm[a — sin~"^(?T,^''' sinii)]) (2) 



We can approximate the angular dispersion d9/d\ by finding the gradient between two 
discrete wavelengths Aq and Ai. If we plot {9{a, Ai, ii) — 9{a, Aq, ii))/(Ai — Aq)) over a range 
of ii for a flint prism, say, we find that it increases with a to a theoretical maximum at 
which point a ^ 74° and ii = 90° [14]. This corresponds to a ray at glancing incidence on 
the first face followed by a symmetric passage or, equivalently, minimum deviation through 
the prism. Alternatively, we can substitute ii with ri and differentiate to find 

9 , , ncosri ncosfa — ri) 

—9{a,X,ri) = -j= , (3) 

J 1 — n"^ sin^ ri Jl — nsm'^{a — ri) 



Thus, minimum deviation occurs when ri = r2 = |a which corresponds to a symmetric 
passage through the prism. At minimum deviation, we let i = ii = 12 and r = ri = r2- 
After substituting 9 — 2i — r and a — 2r, one finds 

d9 2siniQ; dn 

— — (4) 

^1 - n2 sin^ la dX 

The maximum apex angle is not used in practice because most of the light is reflected at 
the first surface. Manufacturers cut prisms to apex angles (e.g., 30°, 45°, 60°) that minimize 
wasted glass. The 60° prism is a good compromise for which the angular dispersion is 
approximately n dn/dX. This function rises rapidly to large and small wavelength cut-offs 
[11,21]. Thus, prisms disperse most efficiently at their absorption limits and therefore find 
use from ultraviolet to infrared wavelengths (100 nm— 60;um). A major disadvantage of 
prisms is the strongly non-linear angular dispersion. If the collimated beam is matched to 
the prism, it follows from Eq. (4) that TZ = B d9/dX = L dn/dX where L is the length of 
the prism base and B is the beam diameter. At grazing incidence, B = and therefore 
TZ — 0. Finally, for a large 10 cm glass prism, dn/dX ~ 10~^ nm~^, and TZ ~ 10^ which is 
roughly the practical limit of prism spectrometers. 
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18.4. Diffraction Gratings 

If a plane wave of wavelength A is incident at an angle a to the perpendicular of a 

periodic grating with groove spacing o", the outbound reflected beams at an angle /3 interfere 
constructively according to the grating equation 

(7(sin a + sin /3) = m\ (5) 

where m is the order of interference and the angle conventions are illustrated in Fig. 2(ii). 
It is common practice to make a > /3 to minimize scattered light. The angular dispersion 
follows by differentiating this equation 

dl^^r^ (6) 
dX a cos /3 

In many applications /? > 30°, which causes the angular dispersion to be slightly non-linear 
and therefore the wavelength scale at the detector plane must be calibrated from a reference 
spectrum. Standard ruled gratings have a = 1/600 to 1/1200 mm and are normally used 
in relatively low order (< 5). The longest wavelength accessible with a grating is 2a, so 
gratings in the infrared tend to be coarsely ruled. For a grating length L, the maximum 
resolving power increases as TZ = Lm/a and can exceed 10^. The spectral purity, 5\ 
depends on the collimator focal length, /cou, such that 

OA — — COSCK— (7) 

/coll m 

where w is the width of a detector element. 

Fourier optics demonstrates that the flux distribution in the focal plane is the Fourier 

power spectrum of the transmission (or reflection) function over the collimated beam. 
The grating acts as a filter that blocks all spatial frequencies except those associated with 
its groove frequency and the spatial frequency content of each groove [16]. The number 
of grooves and the maximum pathlength difference is set by the grating length L. The 
rectangle function that defines each grating groove produces a broad diffraction envelope 
(called the hlaze function) that modulates the flux of each spectral order, and whose width 
increases as the groove narrows (cf. Fig. 3). If we ignore the entrance sht, the wide rectangle 
function that defines the grating produces a high-frequency diffraction envelope (or cluster) 
at each spectral order which is modulated by the broad diffraction envelope. The width of 
each cluster decreases as the overall length of the grating increases. The maximum intensity 
of the secondary peak in a cluster is a fraction N'"^ of the central peak, where N (= L/a) 
is the total number of grooves in the grating. Note that the discontinuities in optical path 
length produced by the light-blocking strips between the slits or the steps between the 
mirrored facets provide the high spatial frequencies that are essential for shaping the profile 
by interference. 

18.4.1. Grating Fabrication. Plane reflective gratings are constructed by ruhng with a 
diamond tool either an aluminum or a gold coating on a low-expansion glass substrate. 
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To reduce costs, many epoxy-resin replicas are made from each glass master. Wear on 
the tool and changes in the operating environment during the grooving process limit the 
maximum ruled area to about 35 x 45 cm, but gratings can be mosaiced in much the same 
way as sihcon chips (§ 18.2.3). It takes several weeks of cutting at 10 grooves min~^ to 
fabricate a grating, during which no distance should drift by more than 20 nm. The actual 
resolution attained with a grating is smaller than calculated due to slow variations in the 
ruling density that can degrade coherence, broaden the wings of the instrumental profile, 
and produce spurious spectral line and continuum features at the detector [29, 30]. Periodic 
deviations in the engraved facets from flaws in the ruling engine produce periodic spectral 
features near bright spectral lines (Rowland ghosts). When one or more periodic deviations 
are present, interference of the ghost diffraction patterns can generate spurious lines far 
from the original spectral feature (Lyman ghosts). 

Holographic gratings do not suffer from these mechanical constraints and therefore can 
be fabricated with much higher groove densities (3500—6000 grooves mm^^). They are 
formed by imaging the interference pattern of a laser-fed Fizeau, Michelson (see §18.6) or 
Twyman Green interferometer onto a glass plate that has been covered with photo-resistive 
emulsion. The unexposed areas are etched away in an acid bath, and the resulting sinusoidal 
surface undulations form a grating. The profile of each facet must be squared to maintain 
high efficiency [31]. Another advantage of these gratings is that the astigmatism of an 
off-axis spectrograph can be compensated for by shaping the wavefronts of the interfering 
beams. However, all gratings produce at least two kinds of stray light [32]: a Lorentzian 
component predicted by diffraction theory, and a Rayleigh-scattered component due to 
microscopic surface defects. 

18.4.2. Influence of the Entrance Slit. The entrance aperture serves a dual function in 
slit spectrometers. First, it restricts flux to a particular region of the source and ensures 
that the flux is dispersed onto a uniform and low-level background that is imaged by 
the detector. Second, the recorded spectral lines are inverted images of the slit width, 
convolved by the grating profile and further broadened by aberrations within the optics. 
The choice of a slit width is a compromise between reduction of the source intensity or the 
intrinsic resolution of the grating. To avoid light loss by diffraction, the width cannot be 
reduced below about five times the operating wavelength. In practice, it is hard to make an 
adjustable slit narrower than 10 /im that maintains parallelism. 

The slit may actually be an aperture mask with many separate openings (slitlets) 
distributed across the focal plane but arranged so that spectra do not overlap at the 
detector. Even more flexibility is possible by positioning many optical flbers at widely 
distributed points in the focal plane [25,26]. Such 'fiber feeds' have been attached to existing 
spectrometers to greatly increase their efficiency for certain projects. One end of the fibers 
can either be attached to precut holes in a custom mask or can be moved to arbitrary 
positions by a robot arm. The other ends can then be lined up along the spectrograph slit 
so that the spectra do not overlap whatever the fiber position. It is common practice to 
reserve some of the fibers for direct observations of the background signal. If the source 
does not occupy the entire length of a slitlet, aperture masks produce better background 
sampling. This is because each fiber has a slightly different response: a fiber that sees 
only the background does not exactly match an adjacent fiber that is illuminated by the 
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combincd source and background signal. Fibers are also used as 'image scramblers' when 
it is important to minimize spectral artifacts that are introduced as a point source moves 
around in a large slit. 

The collimator f/ratio is fixed to that of the primary collecting optics for full 
illumination, while its diameter must be that of the grating to maintain the grating 
resolution. The ratio of the camera and the collimator focal lengths, /cam//coii, demagnifies 
the scale along the slit. The camera focal length is set to reduce the slit image width W 
down to the width of two detector elements 2w to satisfy the sampling theorem (§ 18.2.2), 
where 

cos fJ /coll 

The anamorphic factor cos a / cos l3 arises from the different beam sizes seen by the camera 
and the collimator. It is common for this factor to be less than unity to maximize spectral 
resolution, in which case the grating is more face-on to the camera than to the collimator. 
The camera diameter is also matched to the grating length which now fixes the f/ratio. 
Speeds faster than f/2 usually require catadioptric (Schmidt) cameras which have fairly 
inaccessible focal planes, although dioptric cameras are preferred if the detector obscures a 
significant fraction of the beam. Solid glass Schmidt cameras are often used for fast systems 
because the higher index of refraction allows the f/ratio to increase by the same factor. 

In many applications, the input scale is imposed by the source. This is particularly the 
case for large-aperture (> 3 m diameter) astronomical telescopes. The main optics produce 
an image scale at the entrance slit of 206265/(pDpr) (which is typically ^ 10 angular 
seconds of arc mm~^), where Dpj. (in mm) and p are the diameter of the primary optic 
and its f/number, respectively. At even the best sites, images are blurred by turbulence at 
8-10 km altitude to angular diameters of ;^0.6 seconds of arc, which forces both slits and 
fibers to be ;^100 fiia wide in order to pass most of the fight. Thus, for high values of TZ, a 
large, high density grating is required to compensate for the wide slit. This in turn requires 
large-diameter optics and a long focal length for the collimator. In addition, because CCD 
pixels are rarely larger than 25 /xm, the camera f/ratio must be about four times faster 
than the collimator to demagnify the image. Cassegrain beams commonly have p — 7 which 
implies very fast, complex, and expensive camera optics. New generation 8-10 m telescopes 
will use low-order adaptive optics to partially correct the distorted stellar wavefront for 
atmospheric blurring. This will reduce the core of the stellar image to 0.2 seconds of arc 
in diameter, allowing a smaller slit, grating, and optics with minimal light loss. A recent 
development has been to segment the telescope mirror (pupil image) or the focal plane 
image with microlens arrays. These generate many sub-images which are brought to focus 
onto a fiber bundle prior to the slit allowing a much simpler camera design. 

18.4.3. Transmission Grating. Transmission gratings are used mostly in conjunction with 
a prism ('grism') or a lens ('grens'), either in contact or air-spaced. Transmission gratings 
are commonly used as slitless systems which are only practical for spectroscopy of point 
sources on a weak background signal. Because no auxiliary optics are required, they can 
often be incorporated into an existing optical train to provide some wavelength selection 
while maintaining high throughput. The aberrations associated with a non-collimated 
beam (principally coma) can be minimized with suitable grating or detector tilts if the 
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linear dispersion is comparatively low. Note that the spectra are dispersed across the field 
of view. In slitless systems, the spectra are superposed on a variable background signal, 
and will overlap if the field is crowded with sources. 

18.4.4. Blazed Grating. Most high-efficiency gratings are reflective and have peak 
efficiencies near 80%. As the groove width a — b is decreased (see Fig. 2(ii)), the grating 
acts more like a plane mirror and concentrates the light into the non-dispersed m = 
order (Fig. 3). Time delays must be introduced across the grating to shift the peak of the 
diffraction envelope to an angle that corresponds to dispersive non-zero orders. Such blazing 
is easy to do on a reflective grating by grooving at an angle 5 to its normal. The change in 
effective width of the grooves broadens the grating diffraction peak asymmetrically, with 

a more abrupt decline in efficiency on the smaller wavelength side. Once again, the blaze 
function has the form sinc^7 for which 

nacosS, . , „ . , 

7 = [sm(/3 -8)+ sm(Q; - (9) 

A 

Grating manufacturers quote the blaze wavelength Aq for m = 1 and a = i3 = b (Littrow 
configuration); Aq and spectral order m are related by 7 = m7r(Ao — A)/A. However, the 
required exit angle /3b 7^ a, which shifts the blaze to the (typically 10%) smaller wavelength 

Ab = AoCos^(q; -/3b) (10) 

The wavelength of the blaze peak 7 = 1 is Ab = 2(Tsin 5 cos(q; — 5)/m when a -|- /3 = 25. 
The blaze curve drops to 40% of its peak value at the wavelengths A± = mAo/(m =F 72)- 

At high order, the wavelength range spanned by the blaze curve is small. When there is 
a need for large wavelength coverage with moderate resolution (7?. fa 10^~^), the gratings are 
operated in the echelle mode (§18.4.5). Off-the-shelf gratings are available with the blaze 
peak at one of several strong spectral lines. Because the blaze function is quite strongly 
peaked at low orders, the ability to manipulate the orientation of plane gratings accurately 
is an important design goal for an efficient spectrometer. This capability is nontrivial to 
provide within the sealed cryostat of an infrared spectrometer. In the infrared, it is also 
hard to eliminate unwanted higher orders that can extend into the visual. 

Because the grating constitutes a series of stepped mirrors, the blaze efficiency depends 
strongly on the polarization state [16] and strong discontinuities with amphtudes ;^20% 
(called Wood's anomalies) are often present close to the edges of the blaze distribution A±. 
Modern astronomical telescopes usually mount spectrometers at the Nasmyth foci, which 
are fed by a 45° tertiary mirror. Reflection off this mirror induces or alters the polarization 
of the incident light, and the varying polarization angle after the further reflection off the 
grating greatly complicates spectrophotometric calibrations. 

18.4.5. Echelle Grating. While echelles have a low density of grooves (~80 mm~^), they 
arc operated at a large spectral order m ^ 200 and high tilt to maintain a large path 
difference. The adjacent orders overlap, so they are cross-dispersed by directing the light 
through a prism or secondary grating that is oriented perpendicular to the echelle. In this 
way the different spectral orders are distributed as curved arcs across a two-dimensional 
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detector array. The main advantage of an echelle is that most of the detector area is used to 
record spectra (several orders m simultaneously, see Fig. 2(iii)). The large tilt of an echelle 
means that a rectangular mosaic of gratings is necessary for full illumination by a large 
coUimated beam. All of the separate glass substrates must have identically low coefficients 
of thermal expansion (e.g. Schott Zerodur or Corning ULE glass) to ensure a high TZ at the 
end of the several-week-long fabrication cycle of the gratings. 

Spatial information along the entrance slit is limited by the small separation between 
spectral orders. During data reduction, the different orders are extracted then overlapped 
to yield one spectrum that can span the entire sensitivity range of the detector while 
maintaining TZ ~ 10^~^. The blaze peaks when mX = 2crsinQ;, which corresponds to 
many wavelengths with adjacent spectral orders because m is large. The blaze curves of 
the different orders introduce undulations in the combined spectrum with large amplitude 
which must be removed by recording the known continuous spectrum of a calibration source 
with the same setup. The variations in S/N ratio along the spectrum can complicate the 
data analysis. 

18.4.6. Curved Grating. Curved gratings are used to avoid focus degradation from 
differential chromatic dispersion across a large wavelength range, when operating in a 
wavelength region where lenses are ineffective and mirror reflectances are low, or when it is 
desirable to make the spectrometer as compact as possible. The grating surface is figured 
to act as a collimator, a camera, or both. The gratings are one-of-a-kind and are therefore 
expensive, with limited flexibility in the choice of camera focal length to alter the projected 
slit widths and spectral coverage. In addition, because the grating is an off-axis mirror, 

it introduces astigmatism unless an auxiliary mirror is introduced. The magnitude of the 
astigmatism increases with the length of the rulings, and the entrance slit must be aligned 
precisely with the ruling direction to avoid degrading the spectrograph focus. Further 
discussion of grating mounts and spectrograph aberrations can be found in [11] and [12]. 

18.4.7. Immersion Grating. While transmission gratings (§18.4.3) have been largely 
superseded by other grating types (e.g. §18.4.4), prisms have found important uses in the 
context of immersion gratings [33] . Like the transmission grating, the ruling is placed on 
the downstream face of the prism, but now the coUimated beam is reflected and refracted 
along a direction close to the original path. The principle here is that the resolving power of 
the grating is increased by the refractive index of the medium [34] . Immersion gratings are 
particularly useful at infrared wavelengths where high index materials are readily available. 
Anamorphic immersion gratings [35] increase the resolving power still further by utilising 
highly wedged prisms to enhance the anamorphic factor of the coUimated beam (§18.4.2). 



18.5. Multiple-beam Interferometers 

18.5.1. Interference Filter. These monochromators allow a narrow spectral band-pass to 
be isolated. The principle relies on a dielectric spacer sandwiched between two transmitting 
layers (single cavity). The transmitting layers are commonly fused silica in the ultraviolet. 
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glass or quartz in the optical, and water-free silica in the infrared. Between the spacer and 
the glass, surface coatings are deposited by evaporation which partly transmit and reflect 
an incident ray. Each internally reflected ray shares a flxed phase relationship to all the 
other internally reflected rays. For a wavelength A to be transmitted, it must satisfy the 
condition for constructive coherence such that, in the mth order, 

mX = cos 9r (11) 

where 9r is the refracted angle within the optical spacer. The optical gap /x/ is the product 
of the thickness / and refractive index ^ of the spacer. An interference fllter is normally 
manufactured at low order so that neighboring orders spanning very different wavelength 
ranges can also be used [36]. Additional cavities, while expensive, can be added to decrease 
the band-pass or to make the fllter response more rectangular in shape. Either the glass 
material or an absorptive broadband coating is normally sufficient to block neighboring 
orders. 

Filter manufacturers normally provide data sheets that describe operation at room 

temperature and in a collimated beam. If the fllter is used in a converging beam, the 
band-pass broadens asymmetrically and the peak transmission shifts to shorter wavelengths. 
In the collimated beam, the peak transmission shifts to smaller wavelengths by an amount 
which depends on the off-axis angle. In either beam, the wavelength response of the 
interference fllter can be shifted slightly (tuned) to shorter wavelengths with a small tilt of 
the fllter to the optical axis. If A/ is the wavelength of a light ray incident at an angle Oi 
(Fig. 4(i)), then from Snell's Law and equation (11), it follows that 
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for which Ajv is the wavelength transmitted at normal incidence. To shift the band-pass to 
longer wavelengths, one must increase the fllter temperature and, typically, one can achieve 
0.2A K-i. 

A more versatile approach in constructing narrowband fllters is to use dielectric, 
multi-layer thin fllm coatings. A highly readable account of optical interference coatings 
is provided by Baumcister and Pincus [37]. One of the most successful of these is the 
quarter-wave stack in which alternate layers of high and low refractive index media are 
used. Through judicious combinations of refractive index and layer thickness, it is possible 
to select almost any desired bandwidth, reflectance and transmittance [38] . However, fllters 
with band-passes narrower than A/ 100 are difficult and costly to manufacture. 

In the next section, we discuss scanning Fabry-Perot etalons which, for the purposes 
of this review, use air gap spacers and are routinely operated at low and high order. There 
exists another class of interference fllters that is essentially a single cavity Fabry-Perot with 
a solid dielectric spacer. These etalon filters employ a transparent piezo-electric spacer, e.g. 
lithium niobate, whose thickness and, to a lesser extent, refractive index can be modifled by 
a voltage applied to both faces. Once again, tilt and temperature can be used to flnc tune 
the band-pass if it is important to keep the piezo-electric voltages modest. High quality 
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spacers with thicknesses less than a few hundred microns are difficult to manufacture, so 
that etalon filters are normally operated at high orders of interference. 

Any air-glass interface reflects about 4% of the incident light. This can be signiflcantly 
reduced (1% or better) by the application of an anti-reflective coating. In its simplest 
form, this constitutes a single A/4 layer of, say, MgF2 whose refractive index is close to 

(Fig. 4(i)). A multi-layer 'V-coat' with alternating layers of Ti02 and MgF2 dielectric 
coatings can reach reflectivity at speciflc wavelengths. Many advances in spectroscopic 
techniques in recent years have arisen from the refinement of multi-layer coatings. The 
most recent developments have used rare earth oxides and very thin (5A) metaUic layers. 
However, coating performance is currently limited by the availability of pure transparent 
dielectrics with high refractive indices. 

18.5.2. Fabry-Perot Spectrometer. There exists a wide class of multiple-reflection 
interferometers [36]. With the exception of interference filters, the Fabry-Perot remains 
the most popular in this class. Fabry-Perots are not true monochromators in the sense of 
interference or acousto-optic filters (§18.5.3) and, indeed, require an auxiliary low resolution 
monochromator to block out neighboring orders. Fig. 5 shows the simple construction of a 
Fabry-Perot spectrometer: an etalon d is placed in a coUimated beam between a collimator 
c and a camera lens e. The internal structure of the etalon is shown in Fig. 4. The image 
plane detector g resides within a cooled detector housing; light passes through a window /. 
An interference filter b is normally placed close to the focal plane a or in the collimated 
beam (Fig. 5(ii)). The etalon comprises two plates of glass kept parallel over a small 
separation I (Fig. 4(ii)), where the inner surfaces are mirrors coated with refiectivity 3?. 
The transmission of the etalon to a monochromatic source A is given by the Airy function 



where 6 is the off-axis angle of the incoming ray and /x/ is the optical gap. The peaks in 
transmission occur at mX — cos 9 where m is the order of constructive coherence. From 
this equation, it is clear that A can be scanned physically in a given order by changing 9 
(tilt scanning), fi (pressure scanning), or / (gap scanning). Both tilt and pressure scanning 
suffer from serious drawbacks which limit their dynamic range [39]. With the advent 
of servo-controlled, capacitance micrometry [40,41], the performance of gap scanning 
etalons surpasses other techniques. These employ piezo-electric transducers that undergo 
dimensional changes in an applied electric field, or develop an electric field when strained 
mechanically. In practice, the etalon plates are built to a characteristic separation or 
zeropoint gap about which they move through small physical displacements. The scan range 
is limited by the dynamic range of the transducers to roughly ±5 fim. For an arbitrary 
etalon spacing in the range of a few microns to 30 mm, it is now possible to maintain 
plate parallelism to an accuracy of A/200 while continuously scanning over several adjacent 
orders about the zeropoint gap [42]. 

The resolution of the Fabry-Perot is set primarily by the reflective coating that is 

applied to the etalon plates. The r'cflective flnesse is deflned as Nb, = itV^/(1 — 3?). Ideally, 
this constant defines the number of band-pass widths (spectral elements) over which an 
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etalon may be tuned without overlapping of orders. In other words, the finesse is the ratio 
of the inter-order spacing {free spectral range) and the spectral purity {AX/6X). The etalon 
is fully specified once the free spectral range and spectral purity are decided [43,44] because 
this sets the order of interference (m = A/AA), the resolving power {TZ — X/SX — ttiNe), 
and the zeropoint gap {Iq — A^/(2/xAA)). 

However, what is measured by the instrument is the effective finesse Ne which is 
always less than or roughly equal to the theoretical value of N^i. There are two effects 
that serve to degrade the theoretical resolution. If the beam passing through the etalon 
is not fully coUimated, the instrumental function broadens and its response shifts to 
smaller wavelengths. This is analogous to what happens to an interference filter in a 
converging beam [45]. The induced profile degradation is measm^cd by the aperture finesse, 
or Na = 27r/{mQ) where Q is the solid angle set by the f/ratio of the incoming rays. The 
profile degradation is negligible in beams slower than f/15. Another important source of 
degradation arises from defects in the fiatness of the etalon plates. The defect finesse is 
given by Nj:, = A/(2 51) where 61 is the rms amplitude of the micro-defects. In practice, the 
defect finesse will also include terms for large-scale bowing and drifts in parallelism [39]. 
In essence, to realize an effective finesse of 50 requires that the spacing and parallelism be 
maintained to A/ 100. In summary, the effective finesse is set by the characteristics of the 
etalon surfaces, for which we can write l/iV| = l/iV|. + 1/N\ + l/iV|,. 

There are two primary methods for extracting spectral information from imaging 
Fabry-Perot spectrometers. The interferogram or areal method measures the perturbations 
in the radii of the interference rings to determine spectral differences across the field of 
observation. Roesler [46] shows that if the radius r of the A;th ring has been perturbed by 
5, then the corresponding fraction of a free spectral range is (r^.,.^ — T^)/(r|_,_j^ — r^). While 
this method is of historical importance, it is now more convenient to assemble the image 
frames into a three-dimensional data stack and to form spectra along one axis. In this 
spectral method the etalon is scanned to obtain a sequence of narrowband images taken over 
a fixed grid of etalon spacings. As the gap is scanned, each pixel of the detector maps the 
convolution of the Airy function and the filtered spectrum at that point [43,44]. 

It is possible to radically alter the resolving power and free spectral range by scanning 
etalons in series simultaneously [46]. In principle, all possible combinations of a mixture of 
3 — 5 high and low finesse etalons can be used to mimic the effect of a tunable filter. While 
it is still necessary to use a broadband filter to block unwanted orders, only a handful are 
needed to cover the full optical spectrum. However, to use a Fabry-Perot as a tunable 
filter without the necessary phase correction requires that we restrict the observations to 
the Jacquinot central spot. This is defined as the field about the optical axis within which 
the peak wavelength variation with field angle does not exceed of the etalon band-pass 
[21]. A good discussion of Fabry-Perot based tunable filters is given by [39]. The basic 
principle is to use the etalon in low order so that widely different wavelength regions are 
accessible by using the adjacent orders. At optical and near-infrared wavelengths, Eq. (11) 
indicates that to reach the lowest orders requires gap spacings of only a few microns. A 
major obstacle, however, is that special techniques are now needed to deposit non-laminar 
dielectric coatings [38]. There have been recent advances in maintaining A/200 parallelism 
over plate separations of 30 mm down to 1.8 /xm [39], allowing for free spectral ranges 
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of 0.05—50 nm at optical wavelengths. Presently, techniques are under investigation for 
purging incompressible dust grains from the etalon air gap. Thus, in principle, it should be 
possible to operate several etalons at low order, both separately and in tandem, to simulate 
a variable band-pass tunable filter over the visible or near-infrared spectrum. However, 
the design, manufacture and stability of extended bandwidth coatings remain important 
obstacles for broadband tunable filters. 

Even a minimal Fabry-Perot arrangement can have eight or more optically flat surfaces. 
At some level, all of these surfaces interact separately to generate spurious reflections. The 
periodic behavior of the etalon requires that we use a narrowband filter somewhere in the 
optical path. Typically, the narrowband filter is placed in the converging beam before 
the collimator or after the camera lens (Fig. 5(i)). The filter introduces ghost reflections 
within the Fabry-Perot optics (Fig. 5(ii)). The pattern of ghosts imaged at the detector is 
different in both arrangements, as illustrated in Fig. 5. The dominant reflections are mostly 
deflected out of the beam by tilting the etalon through a small angle with respect to the 
optical axis. A more difficult problem arises from the optical blanks which form the basis 
of the etalon. These can act as internally reflecting cavities (cf. Fig. 4(i)) that generate a 
high order Airy pattern at the detector [18,43]. Traditionally, the outer surfaces have been 
wedge-shaped to deflect this spurious signal out of the beam (Fig. 4(iii)). Even curved 
lens surfaces occasionally produce 'halation' around point source images which may require 
experimenting with both bi-convex and plano-convex lenses when designing a focal reducer. 

18.5.3. Birefringent Filter. The underlying principle of birefringent filters is that light 
originating in a single polarization state can be made to interfere with itself [47]. The 
Michelson interferometer (§18.6) achieves interference by splitting the input beam and 
sending the rays along different path lengths before recombining them. By analogy, an 
optically anisotropic, birefringent medium can be used to produce a relative delay between 
ordinary and extraordinary rays aligned along the fast and slow axes of the crystal. (A 
birefringent medium has two different refractive indices, depending on the plane of light 
propagation through the medium.) Title and collaborators have discussed at length the 
relative merits of different types of birefringent filters [see references in 47]. The filters are 
characterised by a series of perfect polarizers (Lyot filter [48,49]), partial polarizers, or only 
an entrance and an exit polarizer (Sole filter [50]). The highly anisotropic off-axis behaviour 
of uniaxial crystals give birefringent filters a major advantage. Their solid acceptance angle 
is one to two orders of magnitude larger than is possible with interference filters (§18.5.1) 
although this is partly offset by half the light being lost at the entrance polarizer. To our 
knowledge, there has been no attempt to construct a birefringent filter with a polarizing 
beam-splitter, rather than an entrance polarizer, in order to recover the lost light. 

The Lyot filter is conceptually the easiest to understand. The entrance polarizer 
is oriented 45° to the fast and slow axes so that the linearly polarized, ordinary and 
extraordinary rays have equal intensity. The time delay through a crystal of thickness d of 
one ray with respect to the other is simply d A/x/c where A/x is the difference in refractive 
index between the fast and slow axes. The combined beam emerging from the exit polarizer 
shows intensity variations described by J^cos(27r d Ayu/A) where / is the wave amplitude. 
As originally illustrated by Lyot [49] , we can isolate an arbitrarily narrow spectral band-pass 
by placing a number of birefringent crystals in sequence where each element is half the 
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thickness of the preceding crystal. This also requires the use of a polarizer between each 
crystal so that the exit polarizer for any element serves as the entrance polarizer for the 
next. The resolution of the instrument is dictated by the thickness of the thinnest element. 

The instrumental profile for a Lyot filter with s elements is 

^ _ 1 sin^ (2V d Afx/X) 
~ 4* sin^ (tt d A/i/X) 

for which d is now the thickness of the thinnest crystal element. By analogy with the 
Fabry-Perot (§18.5.2), if Aq is the wavelength of the peak transmission, the filter bandwidth 
(= 0.88Ag/(2'*(i A/x)), the free spectral range (= Xl/{d A/x)) and the effective finesse 
(= 1.13 2'^) are easily derived. 

It should be noted that Aq can be tuned over a wide spectral range by rotating the 
crystal elements. But to retain the transmissions in phase requires that each crystal 
element be rotated about the optical axis by half the angle of the preceding thicker crystal. 
The NASA Goddard Space Flight Center have recently produced a Lyot filter utilising 
eight quartz retarders with a 13 cm entrance window. The retarders, each of which are 
sandwiched with half-wave and quarter-wave plates in addition to the polarizers, are rotated 
independently with stepping motors under computer control. They achieve a band-pass of 
4— 8A tuneable over the optical wavelength range (3500-7000A). 

18.5.4. Acousto-optic Filter. In 1969, Harris and Wallace introduced a new type of 
electronically tunable filter that makes use of coUinear acousto-optic diffraction in an 
optically anisotropic medium [51]. Acousto-optic tunable filters (AOTF) are formed by 
bonding piezo-electric transducers such as lithium niobate to an anisotropic birefringent 
medium. The medium has traditionally been a crystal, but polymers have been developed 
recently with variable and controllable birefringence. When the transducers are excited at 
frequencies in the range 10-250 MHz, the ultrasonic waves vibrate the crystal lattice to 
form a moving phase pattern that acts as a diffraction grating. A related approach is to 
use liquid crystals made of nematic (anisotropic) molecules. When a transverse electric 
field is applied, the molecules align parallel to the field because of their positive dielectric 
anisotropy and form a uniaxial birefringent layer. In acousto-optic filters, the incident light 
Bragg-scatters off the moving pattern from one polarization state into its orthogonal state. 
The birefringent interaction permits a large angular aperture which is unattainable with 
isotropic Bragg diffraction. If the crystal is thick enough and the driving power high enough 
(several watts for each cm^ of aperture), only a limited band of optical frequencies in the 
incident light is cumulatively diffracted for a given acoustic frequency. The wavevector of 
the diffracted output beam is the vector sum of the input wavevector and the acoustical 
wavevector; out-of-band frequencies remain undeviated. Crystals such as Te02, quartz 
and MgF2 are highly transmitting and have an efficient acousto-optic response in different 
wavelength regimes from 200 nm to 5 /xm. For currently attained crystal homogeneity and 
thickness, TZ can reach 10^ and out-of-band fight is rejected down to a level of about 10"^. 

In the more useful non-coUinear filters [52,53], the acoustic and optical wave vectors 
differ in such a way that the phase differences introduced by variations in the angle of 
incidence can be approximately compensated by the different refractive indices of the 
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ordinary and extraordinary rays rio and rig, respectively. For an extraordinary polarized 
incident beam, = ((cos ^^j/no)^ + (sin ^^i/ne)^)^"*^/^. If the incident angle is 6i, the vacuum 
central wavelength is Aq, and the acoustic wavelength is A, the diffracted angle 9d depends 
only weakly on A and is given by solving 

= + - luoUi cos[0i - 9d{\)\ (15) 

If iaiiOfi = {rio/neY iaiiO-i (the non-critical phase matching configuration), and the 
interaction length is short enough, the acceptance angle for diffraction-limited imaging 
can increase to almost 28° (i.e. an f/2 beam) to provide throughput comparable to that 
of a Fabry- Perot. However, TZ is then limited by the small attainable density of the 
acoustic driving power. The diffracted ordinary and extraordinary rays emerge displaced 
to either side of the undiffracted out-of-band fight by angles of 1°-10° (Fig. 6). The 
undeviated out-of-band fight is removed with an aperture mask in the collimatcd beam. It 
is particularly difficult at infrared wavelengtfis to keep tfiis ligfit from scattering back into 
tfie optical patfi, tfiereby increasing tfie background. Wfiile tfie corresponding f/numbers of 
50 to 5 are too slow for imaging spectroscopy of low surface-brightness sources, for brighter 
objects these filters simultaneously deliver the two orthogonally polarized images with more 
than 65% efficiency and so are reliable and compact imaging spectTopolarimeters [54]. In 
beams as fast as f/2, the incident light can be sent through a linear polarizer in front of the 
filter, and a crossed polarizer removes one of the deviated beams after passage through the 
crystal for a net throughput in excess of 30%. 

The instrumental profile of an acousto-optic filter of length L that operates at a central 
wavelength Aq in a coUimated beam is like that of a grating, 

T^Tosinc^^, (16) 

with FWHM 

_ O.QvrAg _ 2nn 

- Ld^, - Ak' ^^^^ 

where L is the interaction length, n is one index of refraction, Ak is the mismatch between 
optical and acoustic wavevectors, and 9i is the incident angle relative to the crystal optical 
axis. The dispersive term, d{X), becomes very large near a band edge. The band-pass 
can be altered rapidly across a large wavelength range merely by tuning the power and 
frequency of the acoustic wave, to form a composite band-pass shape with widely separated, 
broad multiple peaks. Such spectral multiplexing across several selected pass bands is a 
unique capability of acousto-optic filters. 

These filters can be used alone as moderate-resolution imaging spectrometers, even in 
the ultraviolet and infrared where it is extremely difficult to produce interference filters with 
band widths narrower than A/ 100 which also have high transmission and good off-band 
rejection. Alternatively, they can be used in a collimated beam as the order-sorting filter of 
a Fabry-Perot etalon. The spatial resolution can be as good as 15 /xm, well matched to the 
size of CCD detector elements. Crystal performance does not appear to deteriorate with 
age, unlike multi-layer coatings used with interference filters and Fabry-Perot etalons. 
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Current disadvantages include their expense, long fabrication time, and small size (<25 
mm square for crystals that are uniform enough for good imaging) relative to interference 
filters, restrictions that are likely to be lifted as the commercial market develops. Another 
concern that is particularly acute in the infrared is the power dissipated and heat generated 
during their operation. Nonetheless, an acousto-optic filter today is often much cheaper 
than a comparably performing Fabry-Perot and control electronics, and also avoids the 
complications of selecting among the multiple spectral orders that characterize etalons. 



18.6. Two Beam Interferometers 



All spectroscopic techniques rely ultimately on the interference of beams that traverse 
different optical paths to form a signal. The prism uses essentially an infinite number of 
beams whereas the grating uses a finite number of beams set by the number of grooves. 
The Fabry-Perot uses a smaller number of beams set by the instrumental finesse. Bell [19] 
notes that as the number of beams decreases, the throughput (and therefore efficiency) of 
the spectrograph increases. Because at least two beams are required for interference. Bell 
concludes that two-beam interferometers are the ultimate in spectrometers. The two most 
commonly used Fourier Transform devices divide either the wavcfront (lamellar grating 
interferometer) or the wave amplitude (Michelson interferometer). The efficiency of the 
latter is ^50% whereas the former technique can approach 100%. Lamellar gratings are 
discussed in [13]. 

A simple two-beam Michelson interferometer is shown in Fig. 7 and forms the basis 
of the Fourier Transform spectrometer. The collimated beam is split into two beams at 
the front surface of the beam-splitter. These beams then undergo different path lengths 
by refiections off separate mirrors before being imaged by the camera lens at the detector. 
The device shown in Fig. 7 uses only 50% of the available fight. It is possible to recover 
this light but the layout is involved [55,56]. For all systems, the output signal is a function 
of path difference between the mirrors. At zero path difference (or arm displacement), the 
waves for all frequencies interact coherently. As the movable mirror is scanned, each input 
wavelength generates a series of transmission maxima. Commercially available devices 
usually allow the mirror to be scanned continuously at constant speed, or to be stepped 
at equal increments. At a sufficiently large arm displacement, the beams lose their mutual 
coherence. 

The spectrometer is scanned from zero path length (x = y = 0) to a maximum path 
length y — L set by twice the maximum mirror spacing (x — L/2). The superposition of 
two coherent beams with amplitude bi and 62 in complex notation is bi + 626*^'^''^ where y is 
the total path difference and v is the wavenumber. If the light rays have the same intensity, 
the combined intensity is 26^(1 -|- cos 27ih'y), or equivalently, 46^ cos^ TTi/y, where b — hi — 62- 
The combined beams generate a series of intensity fringes at the detector. If it was possible 
to scan over an infinite mirror spacing at infinitesimally small spacings of the mirror, the 
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superposition would be represented by an ideal Fourier Transform pair, such that 

/oo 
B{u){l + cos27ruy) du (18) 
-oo 

/oo 
b{y){l + COS 2niyy) dy (19) 
-oo 

where b{y) is the output signal as a function of pathlcngth y and B[i') is the spectrum we 
wish to determine. -B(z^) and b{y) are both undefined for v < and y < 0: we include the 
negative limits for convenience. Notice that 

1 f°° 

b{y)--b(0) = / B{v)cos2TTvy dv (20) 

2i J —oo 

B{u) = / [b{y)--b{0)]cos27ruydy (21) 

J —oo ^ 

The quantity b{y) — |6(0) is usually referred to as the interferogram although this term is 
sometimes used for b{y). The spectrum S(i/) is normally computed using widely available 
Fast Fourier Transform methods. 

It is clear that the output signal is sinusoidal about some mean continuum value. If 
the wavefronts have different intensities, the depth of the modulation decreases and the 
mean continuum level increases. This is undesirable because the background continuum 
constitutes a source of noise. The contrast of the fringe amplitude with respect to the 
background is known as the fringe visibility. It is particularly important to ensure that all 
rays undergo the same optical path, otherwise the output signal is asymmetric with respect 
to zero path length. In Fig. 7, the beam that reflects off the movable mirror passes through 
the beam-splitter three times. The compensating plate ensures that the beam that reflects 
off the flxed mirror undergoes the same optical path. 

Efficient beam-splitters are crucial to the operation of an FTS device. These often 
comprise dielectric sheets, wire grids or multi-layer dielectric coatings on substrates 
(§ 18.5.1), depending on the wavelength of operation. Particular care must be taken over 
the internally reflecting rays. Typically, the primary transmitted ray dominates and the 
secondary transmitted ray can be neglected. However, the primary and secondary reflected 
rays (cf. Fig. 4(i)) are comparable in intensity. To maximize peak efficiency this requires a 
beam-splitter which maintains constructive interference between the rays over as much of 
the wavelength range as possible. 

In practice the ideal Fourier Transform pair is not realized. The finite maximum 
baseline y = L set by the maximum mirror displacement ultimately limits the instrumental 
resolution although, under certain circumstances, the size of the source can impose 
a stronger constraint [19]. The response of most interferograms declines with higher 
wavenumbers. At some intermediate value of y, the signal-to-noise ratio may fall to 
unacccptably low levels, in which case the effective resolution may be somewhat lower than 
the theoretical value. The FTS is scanned at discrete sampling intervals (mirror spacings) 
which causes the computed spectrum to replicate at wavenumber intervals that are inversely 
proportional to the samphng interval (§ 18.2.2). If the baseline L/2 is sufficiently large to 
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resolve all details within the spectrum, the replicated spectra will not overlap. Thus, the 
integrals in Eqs. (18) and (19) should be replaced by summations over finite limits. 

The optical alignment of an FTS is particularly involved [19]. Traditionally, this is 
done by using, say, a HeNe laser along the optical axis and aligning the beam-splitter with 
the movable and fixed mirrors separately and then together. The mirrors must be aligned 
and sufficiently flat so as to introduce no wavefront errors greater than A/8 [13]. This is a 
major opto-mechanical challenge over the large arm displacements (^1 m) of a two-beam 
interferometer. In some respects, the Fabry- Perot constraint of X/{2N) is easier to meet 
because the plates are optically contacted at a physical spacing of, say, 1 mm, and then 
scanned through a few orders about this spacing, a total mechanical distance of only a few 
microns. There are many ways to introduce phase errors into the computed spectrum. This 
is particularly so if the stepping does not start at the precise position for the zero path 
difference. 

While attention has been given to the difficulties of two-beam interferometry, the 
disadvantages are few when compared with other spectrometers. In particular, the 
instrumental profile can take any form after mathematical filtering. The interferogram of a 
single wavelength is given by b{y) = B^uq) cos27ri/o2/- The expected spectrum is then 

y=L 

S{v) — B{vq) ^ cos27ri/oy cos 27rpy 6y (22) 

y=-L 

It is straightforward to show that this reduces to 

S{iy) = B{uq) L (sine 2t:{uq - p)L + sine 2ti{uo + u)L) (23) 

Because the second term is negligible (<1%) at optical and infrared wavelengths, the basic 
instrumental profile is a sine function. This has the highest resolution possible for a baseline 
of L. 

At the expense of resolution, the convolution theorem [57] allows one to modify 
digitally the instrumental response, i.e., for an arbitrary digital filter f{y), we can write 

S{v) = fJ[f{y)h{y)] = F{v)^ B{v) (24) 

where ★ denotes convolution, FT is the Fourier Transform operator and F{v) = FT[/(y)]. A 
common reason to modify the instrumental response is to reduce the side lobes of the sine 
function, a process known as apodization [57]. A list of apodizing functions is given in [19]. 
In particular, if the interferogram is convolved with a triangle function, f[y) — 1 — \y\/L, 
the instrumental profile now takes the form of a sinc^ function which is the response of both 
the grating (§18.4) and the acousto-optic filter (§18.5.4). 
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Fig. 2. — (i) A rudimentary prism spectrometer, (ii) Angle and groove definitions for a 
conventional, reflective diffraction grating, (iii) A blazed reflection-grating spectrometer. 
The standard grating and echelle arrangements are shown as insets, where the dashed-line 
vectors g and b are the normals to the grating and blaze facets, respectively. The spectral 
format of the echelle on the detector assumes that the different orders have been displaced 
as shown by auxiliary optics. 
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Fig. 3. — Instrumental response of a diffraction grating with blaze angle 5 = 0. BF is the 
sinc^ blaze function and IF is the interference response to a monochromatic source. When 
(7 = 6, only the m = order passes flux because the other peaks coincide with the BF zeros. 
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Fig. 4. — (i) Interference filter: the internal structure is not shown, (ii) Fabry-Perot etalon. 
(iii) Wedged Fabry-Perot etalon. 
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Fig. 5. — Ghost families arising from internal reflections within a Fabry-Perot spectrometer, 
(i) Diametric ghosts. Rays from the object O form an inverted image I and an out-of-focus 
image at R3. The reflection at Ri produces an out-of-focus image at R2. The images at R2 
and R3 appear as a ghost image G at the detector, (ii) Exponential ghosts. The images at 
R2 and R4 appear as ghost images Gi and G2 respectively. 



-27- 




Fig. 6. — A crystalline, non-coUinear acousto-optic tunable filter. 9i is the angle of incidence 
relative to the optic axis, 9a the angle between the acoustic wave and optic axis, and ^d(A) 
the diffraction angle. The beams separate when they exit the filter. 




Fig. 7.- 



Two-beam Michelson interferometer (Fourier Transform spectrometer). 



